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ON THE TAILS OF THE LIMITING QUICKSORT 
DISTRIBUTION 

SVANTE JANSON 


Abstract. We give asymptotics for the left and right tails of the lim¬ 
iting Quicksort distribution. The results agree with, but are less precise 
than, earlier non-rigorous results by Knessl and Spankowski. 


1. Introduction 


Let Xn be the number of comparisons used by the algorithm Quicksort 
when sorting n distinct numbers, initially in a uniformly random order. 
Equivalently, is the internal pathlength in a random binary search tree 
with n nodes. (See e.g. Knuth [7, Sections 5.2.2 and 6.2.2] or Drmota [1, 
Chapter 8 and Section 1.4.1] for a description of the algorithm and of bi¬ 
nary search trees.) It follows that Xn satisfies the distributional recurrence 
relation 

Xn = -b X*_u^ + n — 1, n > 1, (1.1) 

where = denotes equality in distribution, and, on the right, Un is distributed 
uniformly on the set {1,..., n}, X* = Xj, Xq = 0, and LQ, Aq, ..., Xn-i, 
Aq, ..., X*_^ are all independent. (Thus, (1.1) can be regarded as a defini¬ 
tion of Xn-) 

It is well-known, and easy to show from (1.1), that 


KXn = 2{n + l)Hn — 4:71 ^ 2nlnn, (1-2) 

where Hn '■= X]fc=i n:th harmonic number. Moreover, it was 

proved by Regnier [9] and Rosier [10], using different methods, that the 
normalized variables 


A„ - E A„ 

:= ^ 1 

n 

converge in distribution to some limiting random variable Z, as n —)• oo. 

There is no simple description of the distribution of Z, but various re¬ 
sults have been shown by several different authors. For example, Z has an 
everywhere finite moment generating function, and thus all moments are 
finite [10], with E Z = 0 and Var Z = 7 — Ivr^; furthermore, Z has a density 
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which is infinitely differentiable [11; 2], Moreover, the recurrence relation 
( 1 . 1 ) yields in the limit a distributional identity, which can be written as 

Z = UZ' + {l-U)Z" + g{U), (1.4) 

where U, Z' and Z" are independent, \J U(0,1) is uniform, Z',Z" = Z, 
and g is the deterministic function 

g{u) := 2ulnu + 2(1 — u) ln(l — tt) + 1. (1.5) 

Furthermore, Rosier [10] showed that (1.4) together with EZ = 0 and 
VarZ < oo determines the distribution of Z uniquely; see further [3]. The 
identity (1.4) is the basis of much of the study of Z, including the present 
work. 

In the present paper we study the asymptotics of the tail probabilities 
P(Z ^ —x) and F(Z ^ x) as x —>■ oo. Using non-rigorous methods from 
applied mathematics (assuming an as yet unverified regularity hypothesis), 
Knessl and Szpankowski [ 6 ] found very precise asymptotics of both the left 
tail and the right tail. Their result for the left tail is that, as x —)■ oo, with 
7 = ( 2 -I^)-^ 

F{Z ^ —x) = (ci -|-o(l)) exp(— 026 '*'^) = exp(—(1.6) 

where ci,C 2 ,C 3 are some constants (ci is explicit in [ 6 ], but not C 2 ). For 
the right tail, they give a more complicated expression, which by ignoring 
higher order terms implies, for example, 

F{Z ^ x) = exp(—xlnx — xlnlnx -|- (1 + ln2)x -|- o(x)). (1.7) 

It has been a challenge to justify these asymptotics rigorously, and so far 
very little progress has been made. Some rigorous upper bounds were given 
by Fill and Janson [4], in particular 

P(Z ^ x) ^ exp(—xlnx-|-(1-|-ln2)x), x ^ 303, (1.8) 

with the same leading term (in the exponent) as (1.7), and for the left tail 

F{Z ^ —x) ^ exp(—x^/5), X ^ 0, (1-9) 

which is much weaker than ( 1 . 6 ). 

Also the present paper falls short of the (non-rigorous) asymptotics (1.6)- 
(1.7) from [ 6 ], but we show, by simple methods, the following results, which 
at least show that the leading terms in the top exponents in (1.6)"(1.7) are 
correct. 

Theorem 1.1. (i) Let 7 ;= (2 — As x ^ 00 , 

exp(-e^^+^“i“^+^(^)) ^ F{Z ^ -x) ^ exp(-e^^+^(^)) (1.10) 

(ii) As X ^ 00, 

exp(—xlnx — xlnlnx -|- 0(x)) ^ F(Z ^ x) ^ exp(—xlnx -|- 0(x)). (1.11) 
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We show the lower bounds in Sections 3 and 4, and the upper bounds in 
Sections 5 and 6. The lower bounds are proved by direct arguments using 
the identity (1.4); the upper bounds are proved by the standard method of 
first estimating the moment generating function. 

Remark 1.2. The right inequality in (1.11) follows from the more precise 
(1.8), where an explicit value is given for the implicit constant; we include 
this part of (1.11) for completeness. (The proof in Section 6 actually yields a 
better constant than (1.8) for large x, see (6.10).) We expect that, similarly, 
the implicit constants in the other parts of (1.10)“(1.11) could be replaced by 
explicit bounds, using more careful versions of the arguments and estimates 
below. However, in order to keep the proofs simple, we have not attempted 
this. 

Remark 1.3. We consider only the limiting random variable Z, and not Zn 
or Xn for finite n. Of course, the results for Z imply corresponding results 
for the tails ¥‘{Zn ^ —x) and F{Zn ^ x) for n sufficiently large (depending 
on x), but we do not attempt to give any explicit results for finite n. For 
some bounds for finite n, see [5] and (for large deviations) [8]. 

Remark 1.4. Although we do not work with Zn for finite n, the proofs 
below of the lower bounds can be interpreted for finite n, saying that we 
can obtain Zn ^ —x with roughly the given probability (for large n) by 
considering the event that in the first ©(x) generations, all splits are close 
to balanced (with proportions ^ ± say); similarly, to obtain Zn ^ x 

we let there be one branch of length 0(x) where all splits are extremely 
unbalanced (with at most a fraction (xlnx)“^ on the other side). The fact 
that we require an exponential number of splits to be extreme for the lower 
tail, but only a linear number for the right tail, can be seen as an explanation 
of the difference between the two tails, with the left tail doubly exponential 
and the right tail roughly exponential. 

2. Preliminaries 

Note that g in (1.5) is a continuous convex function on [0,1], with maxi¬ 
mum g{0) = g{l) = 1 and minimum <7(1/2) = 1 — 2In2 = —(2In2 — 1) <0. 

Let V’(i) := be the moment generating function of Z. As said above. 
Rosier [10] showed that i/(t) is finite for every real t. The distributional 
identity (1.4) yields, by conditioning on U, the functional equation 

i/(f):=Ee*^= f i/(ttt)V’((l — u)t)e*®*'“^ dtt. (2.1) 

Jo 

We may replace Z by the right-hand side of (1.4); hence we may without 
loss of generality assume the equality (not just in distribution) 

Z = UZ' + {1-U)Z'' + g{U). 


( 2 . 2 ) 
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3. Left tail, lower bound 

Proof of lower bound in (1.10). Let e > 0 be so small that g{^ + e) < 0, and 
let a := —g{^ + e) >0. For any z, on the event {Z' ^ —z, Z” ^ —z, and 


|?7 - i| ^ e}, (2.2) yields 

Z ^ -\Jz - (1 - U)z + g{U) = -z + g{U) ^-z-a. (3.1) 

Hence, for any real z, 

P(Z^-z-a) ^ 2eP(Z^-z)^. (3.2) 

It follows by induction that 

P(Z ^ -no) ^ P(Z ^ 0)^", n ^ 0. (3.3) 

Consequently, using 2£ ^ 1, P(Z ^ —no) ^ (2eP(Z ^ 0))^", and thus, with 
c := ln(2P(Z ^ 0)) > —oo, 

lnP(Z ^ —no) ^ 2"'(lne + c), n ^ 0. (3.4) 

If X > 0, we take n = \x/a\ and obtain 

lnP(Z ^-x) ^ 2*/“+i(lne + c). (3.5) 

We choose (for large x) e = x“^/^, so, using Taylor’s formula, 

a = —g{\ + ^) = = 21n2 — 1 + 0[x~^) (3.6) 

and thus 

a"^ = (21n2-l)"^ + 0(x"^). (3.7) 


Consequently, (3.5) yields 

lnP(Z ^ -x) ^ 2^/(21n2-l)+0{l)('i^^-l/2 ^ _g7x+0(l)+lnlnx^ (3 3^ 

□ 

4. Right tail, lower bound 

Proof of lower bound in (1.11). Let 0 < 5 < ^. If 0 < U ^ 6, then 

g{U) ^ g{5) = 1 + 25In5 + 0(5) ^ 1 + 35 In 5, (4.1) 

with the last inequality holding provided 5 is small enough. 

Assume that (4.1) holds, and assume that Z' ^ 0, Z” ^ z ^ 0 and U ^ 5. 
Then (2.2) yields 

Z ^ (1 — 5)z + g{5) ^ z — 5z + 1 — 351n5“^. (4.2) 

Consequently, 

P(Z ^ z + 1 - 5z - 35 In5”^) ^ 5P(Z ^ 0) P(Z ^ z). (4.3) 

Let X be sufficiently large and choose 5 = l/(xlnx). Then, for 0 ^ z ^ x, 

z + l-5z-351n5-^ ^ ^ + 1 _ + (4.4) 

In X X In X In X 
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provided x is large enough. Hence, if 6 := 1 — and c := P(Z ^ 0) > 0, 
then for 0 ^ ^ X we have 

P(Z ^ z + 6) ^ C(5P(Z ^ z). (4.5) 

By induction, we find for O^n^x/6+1, 

P(Z ^ nb) ^ P(Z ^ 0) = > {c6)^+\ (4.6) 

Consequently, taking n := [x/ft], 

lnP(Z ^ x) ^ (n + l)(lnc + In5) ^ {x/b + 2)(lnc + In5) 

= (x + 0(x/lnx)) (—Inx — Inlnx + 0(1)) 

= —xlnx — xlnlnx + 0(x). (4-7) 

□ 

5. Left tail, upper bound 

Lemma 5.1. There exists a ^ 0 sueh that for all t > 0, with k := 7“^ = 
2_3— 

^ In 2' 

i/)(—t) < exp(Kt In t + at + 1 ). (5-1) 

Proof. We note that tint ^ —e~^ for t > 0, and thus Ktlnt + at + 1 ^ 
—Ke~^ + 1 > 0. Since f^{t) is continuous and i/)(0) = 1, there exists ti > 0 
such that ipi—t) < exp(l — Ke“^) for 0 ^ t ^ ti, and thus (5.1) holds for all 
such t, and any a ^ 0. Next, let t 2 := 7re^. We may choose a > 0 such that 
(5.1) holds for t G [ti,t 2 ]. 

Before proceeding to larger t, define 

h{u) := ulnu + (1 — n) ln(l — u) (5.2) 

and note that g{u) = 2h{u) + 1 by (1.5). 

Now suppose that (5.1) fails for some t > 0 and let T := inf{t > 0 : 
(5.1) fails}. Then T ^ t 2 , and, by continuity, 

ip{—T) = exp(KTlnr + aT + l). (5-3) 

Furthermore, if 0 < a < 1, then (5.1) holds for t = uT and t = (1 — u)T, 
and thus, recalling (5.2), 

- u)T) 

< exp(/tuTln(uT) + k(1 — u)Tln((l — u)T) + auT + a(l — u)T + 2) 
= exp(ATlnT + ^(alna + (1 — u) ln(l — u))T + aT + 2). 

= exp(K;TlnT + Kh{u)T + aT + 2). 

Furthermore, g{u) = 1 + 2h{u), and thus we obtain 

ip{-uT)f;{-{l - u)T)e-^ 3 {u) ^ exp (aT InT - ((2 - K)h{u) + l)T + aT + 2). 

(5.4) 
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By (5.2), h{u) is a convex function with h{^) = — ln2, h'{^) = 0 and 
h"{u) = u~^ + (1 “ u)~^ ^ 4, and thus by Taylor’s formula, h{u) ^ — ln2 + 


2{u — . Furthermore, 2 — k = l/ln2, and thus 


(2 - K)h{u) + 1 ^ ^ - i) 


1^2 


Combining (2.1), (5.4), and (5.5), we obtain 

V^(-T) ^ ^ exp(^KrinT + aT + 2-{u- 

/ OO ^ 

du 

-OO 


du 


= — exp(Krinr + aT + 2). 


(5.5) 


(5.6) 


Since T ^ t 2 = ^re^, this yields '4){—T) < exp(KTlnT + oT + l), which 
contradicts (5.3). This contradiction shows that no such T exists, and thus 
(5.1) holds for all f > 0. □ 


Proof of upper bound in (1.10). For x ^ 0 and any t ^ 0, by Lemma 5.1, 
¥’{Z ^ —x) ^ < exp(—tx + Kf Int + at + l). (5.7) 

We optimize by taking t = exp(K“^(x — a) — 1) and obtain 

lnP(Z ^ —x) < t{Klnt + a — x) + 1 = —Kt + 1 = —e'’’ ^x+o{i)^ 
which is the upper bound in ( 1 . 10 ) because = 7 . □ 


6. Right tail, upper bound 

As said in the introduction, (1.8) was proved in [4]. Nevetheless we give 
for completeness a proof of the upper bound in ( 1 . 11 ), similar to the proof in 
Section 5. (It is also similar to the proof in [4] but simpler, partly because we 
do not keep track of all constants and do not try to optimize; nevertheless, 
it yields a slight improvement of ( 1 . 8 ) for large x, see ( 6 . 10 ) below.) 

Lemma 6.1. There exists a ^ 0 such that for all t ^ 0, 

V'(f) ^ exp(e* + of). ( 6 - 1 ) 

Note that [4, Corollary 4.3] shows the bound V’(i) ^ exp(2e*) for t ^ 5.02, 
which is explicit, but weaker for large t. 

Proof. Since V’(O) = 1 < e, it follows by continuity that there exists fi > 0 
such that fj{t) ^ e for t € [ 0 ,fi], and thus ( 6 . 1 ) holds for t G [ 0 ,fi] and any 
a ^ 0 . 

Let t 2 ■= 100, and choose a so that (6.1) holds for t G [ti,t 2 ]. Assume 
that (6.1) fails for some t > 0, and let T := inf{t > 0 : (6.1) fails}. Then 
T ^ t 2 , and, by continuity. 


V^(r) = exp(e^ + oT) . 


(6.2) 
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Furthermore, if 0 < u < 1, then (6.1) holds for t = uT and t = {1 — u)T, 
and thus, using (2.1) and the symmetry u 1 — u there, and g{u) ^ 1, 


rl/2 

^p{T)^2 / + auT + + a{l — u)T+ Tg{u)] du 

/• 1/2 . . 

^2j exp(^e“'^ + e'^-“^ + aT + rj dw. 

We consider two cases. 

(i) If uT ^ 1, then ^ 1 — ^uT, and thus 

^uT ^ ^T-uT + + T ^ e + e^(l - ^uT) + (a + l)r. 

Hence, the contribution to (6.3) for u ^ 1/T is no more than 

2 J exp^e'^ + (a + 1)T + e — ^Te^uj du 

< 2exp(e^ + {a + l)T + e) 

= ^ exp(e^ + aT) ^ 0.7V’(T), 


(6.3) 


(6.4) 


(6.5) 


by (6.2) and T ^ t 2 = 100, since 4e® = 60.62. 

(ii) For uT > 1 and u < ^, recalling T ^ t 2 = 100, 

+ aT + T ^ 2e'^““^ + aT + T 2e"^e^ + aT + T 
^ 0.8e^ + T + aT ^ 0.9e^ + aT 
= + aT — O.le^ ^ + aT — 100. (6-6) 

Hence, the contribution to (6.3) for uT > 1 is less than, recalling (6.2), 

exp(e^ + aT- lOO) = e"^°V(r) < O.lV^(r). (6.7) 

Using (6.5) and (6.7) in (6.3), we find 

i^{T) < 0.7V>(r) + O.lV’(T), (6.8) 

a contradiction. Hence T cannot exist and (6.1) holds for all t ^ 0. □ 

Proof of upper hound in (1.11). For x ^ 0 and any t ^ 0, by Lemma 6.1, 

P(Z ^ x) ^ e“**Ee*^ = ^ exp(—tx + e* + at). (6.9) 

We take t = Inx (assuming x ^ 1) and obtain 

P(Z ^ x) ^ exp(—xlnx + X + O(lnx)), x ^ 1. (6.10) 


(The optimal choice of t is actually ln(x—a), but this leads to the same result 
up to o(l) in the exponent, which is absorbed by the error term O(lnx).) □ 
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